智能论文笔记

PRVNet: A Novel Partially-Regularized Variational Autoencoders for Massive MIMO CSI Feedback

Mostafa Hussien , Kim Khoa Nguyen , Mohamed Cheriet

分类：机器学习

2020-11-09

在多输入的多输出频率划分双工（MIMO-FDD）系统中，用户设备（UE）将下行链路通道状态信息（CSI）发送到基础站以报告链接状态。由于MIMO系统的复杂性，发送此信息产生的高架对系统带宽产生负面影响。尽管在文献中已广泛考虑了这个问题，但先前的工作通常假定理想的反馈渠道。在本文中，我们介绍了PRVNET，这是一种受差异自动编码器（VAE）启发的神经网络体系结构，以压缩CSI矩阵，然后再将其发送回噪声通道条件下的基站。此外，我们提出了一种定制的损失功能，该功能最适合所解决的问题的特殊特征。我们还为学习目标引入了另外的正规化超参数，这对于实现竞争性能至关重要。此外，我们还提供了一种有效的方法，可以使用kl耗电来调整此超参数。实验结果表明，在无噪声反馈通道假设中，提出的模型优于基准模型，包括两个基于深度学习的模型。此外，提议的模型在不同的噪声水平下为加性白色高斯噪声反馈通道实现了出色的性能。

translated by 谷歌翻译

Fixed-budget online adaptive mesh learning for physics-informed neural networks. Towards parameterized problem inference

Thi Nguyen Khoa Nguyen , Thibault Dairay , Raphaël Meunier , Christophe Millet , Mathilde Mougeot

分类：机器学习

2022-12-22

Physics-Informed Neural Networks (PINNs) have gained much attention in various fields of engineering thanks to their capability of incorporating physical laws into the models. PINNs integrate the physical constraints by minimizing the partial differential equations (PDEs) residuals on a set of collocation points. The distribution of these collocation points appears to have a huge impact on the performance of PINNs and the assessment of the sampling methods for these points is still an active topic. In this paper, we propose a Fixed-Budget Online Adaptive Mesh Learning (FBOAML) method, which decomposes the domain into sub-domains, for training collocation points based on local maxima and local minima of the PDEs residuals. The stopping criterion is based on a data set of reference, which leads to an adaptive number of iterations for each specific problem. The effectiveness of FBOAML is demonstrated in the context of non-parameterized and parameterized problems. The impact of the hyper-parameters in FBOAML is investigated in this work. The comparison with other adaptive sampling methods is also illustrated. The numerical results demonstrate important gains in terms of accuracy of PINNs with FBOAML over the classical PINNs with non-adaptive collocation points. We also apply FBOAML in a complex industrial application involving coupling between mechanical and thermal fields. We show that FBOAML is able to identify the high-gradient location and even give better prediction for some physical fields than the classical PINNs with collocation points taken on a pre-adapted finite element mesh.

translated by 谷歌翻译

Contextual Explainable Video Representation:\\Human Perception-based Understanding

Khoa Vo , Kashu Yamazaki , Phong X. Nguyen , Phat Nguyen , Khoa Luu , Ngan Le

分类：计算机视觉

2022-12-12

Video understanding is a growing field and a subject of intense research, which includes many interesting tasks to understanding both spatial and temporal information, e.g., action detection, action recognition, video captioning, video retrieval. One of the most challenging problems in video understanding is dealing with feature extraction, i.e. extract contextual visual representation from given untrimmed video due to the long and complicated temporal structure of unconstrained videos. Different from existing approaches, which apply a pre-trained backbone network as a black-box to extract visual representation, our approach aims to extract the most contextual information with an explainable mechanism. As we observed, humans typically perceive a video through the interactions between three main factors, i.e., the actors, the relevant objects, and the surrounding environment. Therefore, it is very crucial to design a contextual explainable video representation extraction that can capture each of such factors and model the relationships between them. In this paper, we discuss approaches, that incorporate the human perception process into modeling actors, objects, and the environment. We choose video paragraph captioning and temporal action detection to illustrate the effectiveness of human perception based-contextual representation in video understanding. Source code is publicly available at https://github.com/UARK-AICV/Video_Representation.

translated by 谷歌翻译

Depth Perspective-aware Multiple Object Tracking

Kha Gia Quach , Huu Le , Pha Nguyen , Chi Nhan Duong , Tien Dai Bui , Khoa Luu

分类：计算机视觉

2022-07-10

本文旨在解决多个对象跟踪（MOT），这是计算机视觉中的一个重要问题，但由于许多实际问题，尤其是阻塞，因此仍然具有挑战性。确实，我们提出了一种新的实时深度透视图 - 了解多个对象跟踪（DP-MOT）方法，以解决MOT中的闭塞问题。首先提出了一个简单但有效的主题深度估计（SODE），以在2D场景中自动以无监督的方式自动订购检测到的受试者的深度位置。使用SODE的输出，提出了一个新的活动伪3D KALMAN滤波器，即具有动态控制变量的Kalman滤波器的简单但有效的扩展，以动态更新对象的运动。此外，在数据关联步骤中提出了一种新的高阶关联方法，以合并检测到的对象之间的一阶和二阶关系。与标准MOT基准的最新MOT方法相比，提出的方法始终达到最先进的性能。

translated by 谷歌翻译

An FPGA-based Solution for Convolution Operation Acceleration

Trung Dinh Pham , Bao Gia Bach , Lam Trinh Luu , Minh Dinh Nguyen , Hai Duc Pham , Khoa Bui Anh , Xuan Quang Nguyen , Cuong Pham Quoc

分类：人工智能 | 机器学习

2022-06-09

基于硬件的加速度是促进许多计算密集型数学操作的广泛尝试。本文提出了一个基于FPGA的体系结构来加速卷积操作 - 在许多卷积神经网络模型中出现的复杂且昂贵的计算步骤。我们将设计定为标准卷积操作，打算以边缘-AI解决方案启动产品。该项目的目的是产生一个可以一次处理卷积层的FPGA IP核心。系统开发人员可以使用Verilog HDL作为体系结构的主要设计语言来部署IP核心。实验结果表明，我们在简单的边缘计算FPGA板上合成的单个计算核心可以提供0.224 GOPS。当董事会充分利用时，可以实现4.48 GOP。

translated by 谷歌翻译

Self-supervised Domain Adaptation in Crowd Counting

Pha Nguyen , Thanh-Dat Truong , Miaoqing Huang , Yi Liang , Ngan Le , Khoa Luu

分类：计算机视觉

2022-06-07

自我训练的人群计数尚未得到专心探索，尽管这是计算机视觉中的重要挑战之一。实际上，完全监督的方法通常需要大量的手动注释资源。为了应对这一挑战，这项工作引入了一种新的方法，以利用现有的数据集，以地面真理来在人群计数中对未标记的数据集（称为域名适应）产生更强大的预测。尽管网络接受了标记的数据训练，但培训过程中还添加了来自目标域的标签的样品。在此过程中，除了平行设计的对抗训练过程外，还计算和最小化熵图。在shanghaitech，UCF_CC_50和UCF-QNRF数据集上进行的实验证明，在跨域设置中，我们的方法对我们的方法进行了更广泛的改进。

translated by 谷歌翻译

Collaborative Learning for Cyberattack Detection in Blockchain Networks

Tran Viet Khoa , Do Hai Son , Dinh Thai Hoang , Nguyen Linh Trung , Tran Thi Thuy Quynh , Diep N. Nguyen , Nguyen Viet Ha , Eryk Dutkiewicz

分类：机器学习

2022-03-21

本文旨在研究入侵攻击，然后为区块链网络开发新的网络攻击检测框架。具体来说，我们首先在实验室设计和实施区块链网络。该区块链网络将实现两个目的，即为我们的学习模型生成真实的流量数据（包括正常数据和攻击数据），并实施实时实验，以评估我们建议的入侵检测框架的性能。据我们所知，这是第一个在区块链网络中用于网络攻击的实验室中合成的数据集。然后，我们提出了一个新颖的协作学习模型，该模型允许区块链网络中的有效部署来检测攻击。提出的学习模型的主要思想是使区块链节点能够积极收集数据，从其数据中分享知识，然后与网络中的其他区块链节点交换知识。这样，我们不仅可以利用网络中所有节点的知识，而且还不需要收集所有原始数据进行培训，以便在常规的集中学习解决方案等集中式节点上进行培训。这样的框架还可以避免暴露本地数据的隐私以及过多的网络开销/拥堵的风险。密集模拟和实时实验都清楚地表明，我们提出的基于协作的入侵检测框架可以在检测攻击方面达到高达97.7％的准确性。

translated by 谷歌翻译

Physics-informed neural networks for non-Newtonian fluid thermo-mechanical problems: an application to rubber calendering process

Thi Nguyen Khoa Nguyen , Thibault Dairay , Raphaël Meunier , Mathilde Mougeot

分类：机器学习

2022-01-31

物理知识的神经网络（PINNS）由于能力将物理定律纳入模型，在工程的各个领域都引起了很多关注。但是，对机械和热场之间涉及耦合的工业应用中PINN的评估仍然是一个活跃的研究主题。在这项工作中，我们提出了PINNS在非牛顿流体热机械问题上的应用，该问题通常在橡胶日历过程中考虑。我们证明了PINN在处理逆问题和不良问题时的有效性，这些问题是不切实际的，可以通过经典的数值离散方法解决。我们研究了传感器放置的影响以及无监督点对PINNS性能的分布，即从某些部分数据中推断出隐藏的物理领域的问题。我们还研究了PINN从传感器捕获的测量值中识别未知物理参数的能力。在整个工作中，还考虑了嘈杂测量的效果。本文的结果表明，在识别问题中，PINN可以仅使用传感器上的测量结果成功估算未知参数。在未完全定义边界条件的不足问题中，即使传感器的放置和无监督点的分布对PINNS性能产生了很大的影响，我们表明该算法能够从局部测量中推断出隐藏的物理。

translated by 谷歌翻译

Deep Transfer Learning: A Novel Collaborative Learning Model for Cyberattack Detection Systems in IoT Networks

Tran Viet Khoa , Dinh Thai Hoang , Nguyen Linh Trung , Cong T. Nguyen , Tran Thi Thuy Quynh , Diep N. Nguyen , Nguyen Viet Ha , Eryk Dutkiewicz

分类：机器学习

2021-12-02

联邦学习（FL）最近成为网络攻击检测系统的有效方法，尤其是在互联网上（物联网）网络。通过在IOT网关中分配学习过程，FL可以提高学习效率，降低通信开销并增强网络内人检测系统的隐私。在这种系统中实施FL的挑战包括不同物联网中的数据特征的标记数据和不可用的不可用。在本文中，我们提出了一种新的协作学习框架，利用转移学习（TL）来克服这些挑战。特别是，我们开发一种新颖的协作学习方法，使目标网络能够有效地和快速学习来自拥有丰富标记数据的源网络的知识。重要的是，最先进的研究要求网络的参与数据集具有相同的特征，从而限制了入侵检测系统的效率，灵活性以及可扩展性。但是，我们所提出的框架可以通过在各种深度学习模型中交换学习知识来解决这些问题，即使他们的数据集具有不同的功能。关于最近的真实网络安全数据集的广泛实验表明，与基于最先进的深度学习方法相比，拟议的框架可以提高超过40％。

translated by 谷歌翻译

Multimodal Wildland Fire Smoke Detection

Siddhant Baldota , Shreyas Anantha Ramaprasad , Jaspreet Kaur Bhamra , Shane Luna , Ravi Ramachandra , Eugene Zen , Harrison Kim , Daniel Crawl , Ismael Perez , Ilkay Altintas

分类：计算机视觉

2022-12-29

Research has shown that climate change creates warmer temperatures and drier conditions, leading to longer wildfire seasons and increased wildfire risks in the United States. These factors have in turn led to increases in the frequency, extent, and severity of wildfires in recent years. Given the danger posed by wildland fires to people, property, wildlife, and the environment, there is an urgency to provide tools for effective wildfire management. Early detection of wildfires is essential to minimizing potentially catastrophic destruction. In this paper, we present our work on integrating multiple data sources in SmokeyNet, a deep learning model using spatio-temporal information to detect smoke from wildland fires. Camera image data is integrated with weather sensor measurements and processed by SmokeyNet to create a multimodal wildland fire smoke detection system. We present our results comparing performance in terms of both accuracy and time-to-detection for multimodal data vs. a single data source. With a time-to-detection of only a few minutes, SmokeyNet can serve as an automated early notification system, providing a useful tool in the fight against destructive wildfires.

translated by 谷歌翻译